SEQUENCE LISTING 



(1) GENERAL INFORMATION': 

(i) APPLICANT: KABUSHIKI KAISHA HAY ASH I BARA SEIBUTSU KAGAKU 
KENKYUJO 

(ii) TITLE OF INVENTION: POLYPEPTIDE HAVING jS- FRUCTOFURANOS IDASE 
ACTIVITY 

(iii) NUMBER OF SEQUENCES: 6 

(iv) CORRESPONDENCE - ADDRESS : 

(A) ADDRESSEE: BROWDY AND NEIMARK 

(B) STREET: 419 Seventh Street, N.W., Suite 300 

(C) CITY: Washington 

(D) STATE: D.C. 

(E) COUNTRY: USA 

(F) ZIP: 20004 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER : IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent Xn Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/870,827 

(B) FILING DATE: 06 - JUN- 1 99 7 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: JP 170,630/1996 

(B) FILING DATE: 10- JUN- 1996 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: BROWDY , Roger L. 

(B) REGISTRATION NUMBER: 25,618 

(C) REFERENCE/DOCKET NUMBER: TSUSAKI=2 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-628-5197 

(B) TELEFAX: 202-737-3528 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(iii) Fragment type: N- terminal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Met Asn Ser Gly Asp Tyr Lys Glu Asp Tyr Gly Phe Ala His lie Thr 
! 5 10 15 

Arg Ala Asp Met Leu 
20 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 



(iii) Fragment type: internal fragment 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Ser Val Phe Asp Gly Gly Asp Gly Thr Val Tyr Gin 
15 10 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 455 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



Met 
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Ser 


Gly 


Asp 
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lie 
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Trp 
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Ala 
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Tl- 
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Lys 


Val 


Asp 


Gly 


Val 


Ser 


Asp 


His 


Lys 


Ser 


Val 
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Gly 


Asp 


Gly 
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Lys 


Trp 
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His 


Thr 
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Asp 
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Tyr 


Val 


Glu 


Asp 


Lys 
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Thr 
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Gly 
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Glu 
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Ala 
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Met 
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Trp 
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Met 


Thr 
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Asp 


Gly 


He 


Asn 
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Lys 
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Val 


Tyr 


Met 


Leu 


Gly 


Pro 


Gly 


Gly 


Asp 
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360 
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Ser 


Leu 


Asn 


Gly 


Pro 


His 


Asn 


Pro 


He 


Asn 


Glu 


Thr 


Gly 


Leu 


Val 


Leu 



370 375 380 



Asn Met Asn Leu Asp Pro Ala Asp Leu Thr His Thr Tyr Ser His Cys 
385 390 395 400 

Gly lie Pro His Pro Glu Gly Asn Asn Val Val Leu Thr Ser Tyr Met 

405 410 415 

Thr Asn Arg Gly Phe Tyr Pro Glu His His Ser His Leu Arg Asp Lys 

420 425 430 

Leu Gly Val Asn lie Lys Gly Ser Asp Thr Ser Gly Gly Glu Asn Ser 

435 440 445 

Ser Gly Gin Gly Gin Phe Pro 
450 455 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1365 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

ATG AAC AGC GGG GAC TAC AAG GAA GAC TAT GGT TTT GCC CAT ATT ACA 48 
Met Asn Ser Gly Asp Tyr Lys Glu Asp Tyr Gly Phe Ala His lie Thr 
15 10 15 

□ CGC GCT GAC ATG CTA AAA ATT CCA GGA CAA CAA AAC AGT CCT CAA TTT 96 

yy Arg Ala Asp Met Leu Lys lie Pro Gly Gin Gin Ash Ser Pro Gin Phe 

. -r - 20 25 , 30 

5 AAA GTG CCT CAA TTC AAT GCA TCA GCA ATC AAA AAC ATT GAT TCG GCA 144 

01 Lys Val Pro Gin Phe Asn Ala Ser Ala lie Lys Asn lie Asp Ser Ala 

m 35 40 45 

¥ - 

!jS AAA GGG TAT GAT AAG TCA GGC AAC TTA ATA GAT TTA GAT GTA TGG GAT 192 

Lys Gly Tyr Asp Lys Ser Gly Asn Leu lie Asp Leu Asp Val Trp Asp 
a 50 55 60 

L.: AGC TGG CCA CTG CAA AAC GCT GAT GGT ACT GCG GCA AAT TAT CAT GGA 24 0 

i Ser Trp Pro Leu Gin Asn Ala Asp Gly Thr Ala Ala Asn Tyr His Gly 
W 65 70 75 80 



TAT CAC ATC GTC TCC GCT TTA GCA GGT GAC CCA AAA AAC AGT GAT GAT 2 88 

Tyr His lie Val Ser Ala Leu Ala Gly Asp Pro Lys Asn Ser Asp Asp 
85 90 95 

ACT CCA CTT CAT TTA TTC TAT CAA AAA GTC GGT GAT ACA TCG ATT GAC 33 6 

Thr Pro Leu His Leu Phe Tyr Gin Lys Val Gly Asp Thr Ser lie Asp 

100 105 110 115 

AGC TGG AAA AAT GCT GGA AGA GTA TTT GAA GAT ATG GAT AAA TTT GTT 3 84 

Ser Trp Lys Asn Ala Gly Arg Val Phe Glu Asp Met Asp Lys Phe Val 
120 125 130 

CCA AAT GAT CCG TAT CTT AAA TAT CAA ACA CAG GAG TGG TCA GGT TCT 4 32 

Pro Asn Asp Pro Tyr Leu Lys Tyr Gin Thr Gin Glu Trp Ser Gly Ser 
135 140 145 

GCT ACT TTA ACC AAA GAT GGC CAA GTC CGT TTA TTC TAT ACA GAT TAC 4 80 

Ala Thr Leu Thr Lys Asp Gly Gin Val Arg Leu Phe Tyr Thr Asp Tyr 
150 155 160 

TCA GGT AAT CCT GAA GAT GGT GGA ACC GGT GCT GGT AAC CAA ATC ATT 52 8 

Ser Gly Asn Pro Glu Asp Gly Gly Thr Gly Ala Gly Asn Gin lie lie 
165 170 175 
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TCA ACT GCT CAA GTA AAC TTA TCC CAG CCG GAT GCA GCT ACA CTT AAA 576 
Ser Thr Ala Gin Val Asn Leu Ser Gin Pro Asp Ala Ala Thr Leu Lys 
180 185 190 195 

GTC GAT GGA GTA TCT GAT CAT AAA TCT GTC TTT GAT GGC GGA GAC GGT 624 
Val Asp Gly Val Ser Asp His Lys Ser Val Phe Asp Gly Gly Asp Gly 
200 205 210 

ACA GTT TAT CAA AAT ATT CAG CAA TTT ATC GAT GAA GGC AAG TGG ATT 672 
Thr Val Tyr Gin Asn lie Gin Gin Phe lie Asp Glu Gly Lys Trp lie 
215 220 225 

TCA GGT GAT AAC CAT ACT TTA AGA GAC CCT CAC TAT GTT GAA GAT AAG 72 0 
Ser Gly Asp Asn His Thr Leu Arg Asp Pro His Tyr Val Glu Asp Lys 
230 235 240 

GGC CAT AAA TAT CTT GTC TTT GAA GCG AAT ACT GGA ACA ACA GAT GGT 768 
Gly His Lys Tyr Leu Val Phe Glu Ala Asn Thr Gly Thr Thr Asp Gly 
245 250 255 

TAT CAA GGC GAT CAG TCT TTC AAT AAT AAA GCT TAC TAT GGC GGA AGT 816 
Tyr Gin Gly Asp Gin Ser Phe Asn Asn Lys Ala Tyr Tyr Gly Gly Ser 
260 265 270 275 

GAC GTC TTC TTC CAG AAT GAA AAA AAT AAA CTG CTT CAA AGT CCT AAA 864 
Asp Val Phe Phe Gin Asn Glu Lys Asn Lys Leu Leu Gin Ser Pro Lys 
280 285 290 

AAA CAA ATT GCT TCT TTA GCG AAT GGT GCA TTA GGC ATT GTT GAA TTG 912 
Lys Gin He Ala Ser Leu Ala Asn Gly Ala Leu Gly He Val Glu Leu 
295 300 305 

GCC GAT GAC TAT ACA GTG AAA AGT GTT ATG AAA CCA TTA GTC GCA TCA 960 
Ala Asp Asp Tyr Thr Val Lys Ser Val Met Lys Pro Leu Val Ala Ser 
310 315 320 

AAC ACA GTA GCA GAT GAA GTC GAA CGC GCC AAT ATA TTT AAA ATG AAT 1008 
Asn Thr Val Ala Asp Glu Val Glu Arg Ala Asn He Phe Lys Met Asn 
325 330 335 

AAT AAA TGG TAT CTA TTC ACG GAT TCA AGA GGA TCC AAA ATG ACG AGT 1056 
Asn Lys Trp Tyr Leu Phe Thr Asp Ser Arg Gly Ser Lys Met Thr Ser 
340 345 350 355 

GAT GGA ATT AAC GAC AAA GAT GTT TAT ATG CTA GGG CCC GGA GGC GAC 1104 
Asp Gly He Asn Asp Lys Asp Val Tyr Met Leu Gly Pro Gly Gly Asp 
360 365 370 

TCC TTA AAT GGC CCA CAC AAC CCG ATA AAT GAA ACT GGA CTT GTA TTG 1152 
Ser Leu Asn Gly Pro His Asn Pro He Asn Glu Thr Gly Leu Val Leu 
375 380 385 

AAC ATG AAT CTT GAC CCT GCT GAT CTC ACA CAC ACT TAC TCT CAT TGC 12 00 
Asn Met Asn Leu Asp Pro Ala Asp Leu Thr His Thr Tyr Ser His Cys 
390 395 400 

GGT ATC CCG CAC CCT GAA GGT AAT AAT GTG GTA CTC ACA AGT TAT ATG 124 8 
Gly He Pro His Pro Glu Gly Asn Asn Val Val Leu Thr Ser Tyr Met 
405 410 415 

ACG AAT AGA GGC TTC TAT CCA GAA CAT CAC TCT CAC CTG CGG GAC AAG 12 96 
Thr Asn Arg Gly Phe Tyr Pro Glu His His Ser His Leu Arg Asp Lys 
420 425 430 435 

CTT GGG GTT AAT ATT AAA GGG TCT GAC ACA TCT GGA GGA GAA AAT AGT 1344 
Leu Gly Val Asn He Lys Gly Ser Asp Thr Ser Gly Gly Gltr ^sn Ser 
440 445 450 
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M9 



m 



3 
f 



TCC GGA CAA GGA CAA TTC CCA 1365 
Ser Gly Gin Gly Gin Phe Pro 
455 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2408 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus sp. 

(B) INDIVIDUAL ISOLATE: V230 (FERM BP-5054) 

(ix) FEATURE: 

(A) NAME /KEY : 5 ' UTR 

(B) LOCATION: 1..360 

(C) IDENTIFICATION METHOD: E 

(A) NAME /KEY : signal peptide 

( B ) LOCATION : 3 6 1 . . 4 5 6 

(C) IDENTIFICATION METHOD: S 



(A) NAME/KEY: mat peptide 

(B) LOCATION: 457.. 1821 

(C) IDENTIFICATION METHOD: S 

y s 

rf= (A) NAME /KEY : 3 ' UTR 

(B) LOCATION: 1822.. 2408 

(C) IDENTIFICATION METHOD: E 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



r? CGGGGAAAAT ACTAGATTCC AATTGGCCAG ACTTCCCAGT TGGTGTAAGA GAAGAGTTCG 6 0 

GACTGCCAAT GCAGCTGTGC GTAAGAAAAC AGCTTACTCA TGAGCAATTA CTAGAAGAAT 12 0 

Q TTCAAAAGTC CTGGGATAAG GCCAAGTCCA CTTTGAAATA AACTTTTCAG CCTCTGTGTG 18 0 

yQ GGGGCTTTTT TGTTTTTATT TATTTCAACT GCAAGTGGTC CATCCCCTAT ATCAATTTAA 24 0 

f*jj GACGAAATTC TAATCAATCC ATGCCATCCC CAATAAACTC GTCCTCCTCT AT AC TTTTAA 300 

°ff TTAATAAGAA ACTATCAAGA GCTTTCTTAT CAAATTCATA CATATCCAAG GAGGGAGACG 3 60 

H" ATG AAC TTC AAA AGA TTG GCG AAA AAA GCA GCT GCC GTA ACC TTC AGG 408 
Met Asn Phe Lys Arg Leu Ala Lys Lys Ala Ala Ala Val Thr Phe Arg 
-30 -25 -20 

ACT GCT ATA TTA GTA GGA GCG GAC GGA CCG CAT ATT TTT GCG CAG CAA 456 
Thr Ala lie Leu Val Gly Ala Asp Gly Pro His lie Phe Ala Gin Gin 
-15 -10 -5 

ATG AAC AGC GGG GAC TAG AAG GAA GAC TAT GGT TTT GCC CAT ATT ACA 5 04 
Met Asn Ser Gly Asp Tyr Lys Glu Asp Tyr Gly Phe Ala His lie Thr 
15 10 15 

CGC GCT GAC ATG CTA AAA ATT CCA GGA CAA CAA AAC AGT CCT CAA TTT 5 52 
Arg Ala Asp Met Leu Lys lie Pro Gly Gin Gin Asn Ser Pro Gin Phe 
20 25 30 

AAA GTG CCT CAA TTC AAT GCA TCA GCA ATC AAA AAC ATT GAT TCG GCA 600 
Lys Val Pro Gin Phe Asn Ala Ser Ala lie Lys Asn lie Asp Ser Ala 
35 40 45 

AAA GGG TAT GAT AAG TCA GGC AAC TTA ATA GAT TTA GAT GTA TGG GAT 64 8 
Lys Gly Tyr Asp Lys Ser Gly Asn Leu lie Asp Leu Asp Val Trp Asp 
50 55 60 
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AGC TGG CCA CTG CAA AAC GCT GAT GGT ACT GCG GCA AAT TAT CAT GGA 696 
Ser Trp Pro Leu Gin Asn Ala Asp Gly Thr Ala Ala Asn Tyr His Gly 
65 70 75 80 . 

TAT CAC ATC GTC TCC GCT TTA GCA GGT GAC CCA AAA AAC AGT GAT GAT 744 
Tyr His He Val Ser Ala Leu Ala Gly Asp Pro Lys Asn Ser Asp Asp 
85 90 95 

ACT CCA CTT CAT TTA TTC TAT CAA AAA GTC GGT GAT ACA TCG ATT GAC 7 92 

Thr Pro Leu His Leu Phe Tyr Gin Lys Val Gly Asp Thr Ser He Asp 
100 105 110 

AGC TGG AAA AAT GCT GGA AGA GTA TTT GAA GAT ATG GAT AAA TTT GTT 840 
Ser Trp Lys Asn Ala Gly Arg Val Phe Glu Asp Met Asp Lys Phe Val 
115 120 125 

CCA AAT GAT CCG TAT CTT AAA TAT CAA ACA CAG GAG TGG TCA GGT TCT 888 
Pro Asn Asp Pro Tyr Leu Lys Tyr Gin Thr Gin Glu Trp Ser Gly Ser 
130 135 140 

GCT ACT TTA ACC AAA GAT GGC CAA GTC CGT TTA TTC TAT ACA GAT TAC 936 
Ala Thr Leu Thr Lys Asp Gly Gin Val Arg Leu Phe Tyr Thr Asp Tyr 
145 150 155 160 

TCA GGT AAT CCT GAA GAT GGT GGA ACC GGT GCT GGT AAC CAA ATC ATT 9S4 
□ Ser Gly Asn Pro Glu Asp Gly Gly Thr Gly Ala Gly Asn Gin He He 
• h 165 170 175 

H TCA ACT GCT CAA GTA AAC TTA TCC CAG CCG GAT GCA GCT ACA CTT AAA 1032 
bU Ser Thr Ala Gin Val Asn Leu Ser Gin Pro Asp Ala Ala Thr Leu Lys 
jjf 180 185 190 

m 

ZL GTC GAT GGA GTA TCT GAT CAT AAA TCT GTC TTT GAT GGC GGA GAC GGT 108 0 

%f Val Asp Gly Val Ser Asp His Lys Ser Val Phe Asp Gly Gly Asp Gly 

FU 195 200 205 

L, ACA GTT TAT CAA AAT ATT CAG CAA TTT ATC GAT GAA GGC AAG TGG ATT 1128 

f Thr Val Tyr Gin Asn He Gin Gin Phe He Asp Glu Gly Lys Trp He 
^ 210 215 220 



TCA GGT GAT AAC CAT ACT TTA AGA GAC CCT CAC TAT GTT GAA GAT AAG 1176 
Ser Gly Asp Asn His Thr Leu Arg Asp Pro His Tyr Val Glu Asp Lys 
225 230 235 240 

GGC CAT AAA TAT CTT GTC TTT GAA GCG AAT ACT GGA ACA ACA GAT GGT 1224 
Gly His Lys Tyr Leu Val Phe Glu Ala Asn Thr Gly Thr Thr Asp Gly 
245 250 255 

TAT CAA GGC GAT CAG TCT TTC AAT AAT AAA GCT TAC TAT GGC GGA AGT 1272 
Tyr Gin Gly Asp Gin Ser Phe Asn Asn Lys Ala Tyr Tyr Gly Gly Ser 
260 265 270 

GAC GTC TTC TTC CAG AAT GAA AAA AAT AAA CTG CTT CAA AGT CCT AAA 1320 
Asp Val Phe Phe Gin Asn Glu Lys Asn Lys Leu Leu Gin Ser Pro Lys 
275 280 285 

AAA CAA ATT GCT TCT TTA GCG AAT GGT GCA TTA GGC ATT GTT GAA TTG 1368 
Lys Gin He Ala Ser Leu Ala Asn Gly Ala Leu Gly He Val Glu Leu 
290 295 300 

GCC GAT GAC TAT ACA GTG AAA AGT GTT ATG AAA CCA TTA GTC GCA TCA 1416 
Ala Asp Asp Tyr Thr Val Lys Ser Val Met Lys Pro Leu Val Ala Ser 
305 310 315 320 

AAC ACA GTA GCA GAT GAA GTC GAA CGC GCC AAT ATA TTT AAA ATG AAT 14 64 
Asn Thr Val Ala Asp Glu Val Glu Arg Ala Asn He Phe Lys Met Asn 
325 330 335 
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m 



ru 



AAT AAA TGG TAT CTA TTC ACG GAT TCA AGA GGA TCC AAA ATG ACG AGT 1512 
Asn Lys Trp Tyr Leu Phe Thr Asp Ser Arg Gly Ser Lys Met Thr Ser 
340 345 350 

GAT GGA ATT AAC GAC AAA GAT GTT TAT ATG CTA GGG CCC GGA GGC GAC 1560 
Asp Gly lie Asn Asp Lys Asp Val Tyr Met Leu Gly Pro Gly Gly Asp 
355 360 365 

TCC TTA AAT GGC CCA CAC AAC CCG ATA AAT GAA ACT GGA CTT GTA TTG 1608 
Ser Leu Asn Gly Pro His Asn Pro lie Asn Glu Thr Gly Leu Val Leu 
370 375 380 

AAC ATG AAT CTT GAC CCT GCT GAT CTC ACA CAC ACT TAC TCT CAT TGC 1656 
Asn Met Asn Leu Asp Pro Ala Asp Leu Thr His Thr Tyr Ser His Cys 
385 390 395 400 

GGT ATC CCG CAC CCT GAA GGT AAT AAT GTG GTA CTC ACA AGT TAT ATG 1704 
Gly lie Pro His Pro Glu Gly Asn Asn Val Val Leu Thr Ser Tyr Met 
405 410 415 

ACG AAT AGA GGC TTC TAT CCA GAA CAT CAC TCT CAC CTG CGG GAC AAG 1752 
Thr Asn Arg Gly Phe Tyr Pro Glu His His Ser His Leu Arg Asp Lys 
420 425 430 

CTT GGG GTT AAT ATT AAA GGG TCT GAC ACA TCT GGA GGA GAA AAT AGT 1800 
Leu Gly Val Asn He Lys Gly Ser Asp Thr Ser Gly Gly Glu Asn Ser 
435 440 445 

TCC GGA CAA GGA CAA TTC CCA TA GCGATTATCT CCCAATAAAA AG AAATGT C A 1853 
Ser Gly Gin Gly Gin Phe Pro 
450 455 



CTGGCAAGAT CCTACCTTTT CCCCAACCTT TTTTTTTAAA ATAAAGGGTT TTGTACCACC 1913 
TTTAGAAGAA AAAAGAATCC TTGGCC CGGG CCAATTACCC ATTGCCGAAG GAGCTACCCA 1973 
ATAAAAAAGA AATTTTCCCT TTTTAGGGGG GGAGTCCTTT TTTTCTATCT TGGGGTGGGG 2033 
* ATTGTTGGCC CCCACCAGGG GACCTTTATT AATTTATAGC ATGAATCTGG CGATTTTGCC 2093 

1=4= TGGCCTACTT ATTATCCAAG CCGCCCAGCC AAAAATATTA AACCAAGGAA TCCTAAGAGT 2153 
AGGCGGTACA GTTTACCAAC ACCTGCTGGA GGGGATGGAT ATCGCCCTCA AT AC C ACTTT 2213 
: C TCTGTTCCCG ACAAATGGAA AAATGATCCG CAGAGGCCCA TCTTTTTTGG GGGAAGTATC 22 73 
W ATTACTATTA CCTTTACAAC AAAGACTATC CAGATGGAAA TGGTACGGAA TGGCGGCATG 2333 
HP CAACGTCCGA AGATTTATTG CATTGGACGG ACGAAGGGAT TGCCATCCCG AAGTATACCA 23 93 
£3 ATAAAAATGG TGATC 24 08 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

<ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
GAYTAYAARG ARGAYTAYGG NTTYGC 26 
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